An Information-Theoretic Approach to Quantum Theory, II: 
The Formal Rules of Quantum Theory 

Philip GoyaO 

Cavendish Laboratory 
University of Cambridge 

In a companion paper [l[ (hereafter referred to as Paper I), we have presented an attempt to 
derive the finite-dimensional abstract quantum formalism from a set of physically comprehensible 
assumptions. In this paper, we formulate a correspondence principle, the Average- Value Correspon- 
dence Principle, that allows relations between measurement outcomes which are known to hold in a 
classical model of a system to be systematically taken over into the quantum model of the system. 
Using this principle, we derive the explicit form of the temporal evolution operator (thereby com- 
pleting the derivation of the abstract quantum formalism begun in Paper I) , and derive many of the 
formal rules (such as operator rules, commutation relations, and Dirac's Poisson bracket rule) that 
are needed to apply the abstract quantum formalism to model particular physical systems. 



I. INTRODUCTION 

In order to obtain a quantum mechanical model for a 
particular physical system such as a particle moving in 
space, it is necessary to supplement the abstract quantum 
formalism with formal rules which explicitly determine 
the form of the operators that represent particular mea- 
surements performed on the system, or that represent 
particular symmetry transformations (such as displace- 
ment or rotation) of the frame of reference in which the 
system is being observed. These formal rules usually sup- 
pose that measurements are described in the framework 
of classical physics, so that one speaks of "a measurement 
of observable (or property) A" and of an operator that 
represents such a measurement. These formal rules can 
be usefully classified as follows: 

(i) Operator Rules. Rules for writing down operators 
representing measurements of observables that are 
known functions of other, elementary, observables 
whose operators are given |27| . For example, such 
rules are needed to be able to write down the op- 
erator that represents a measurement of H, given 
the classical relation H — p x /2m + V(x), in terms 
of the operators x and p x which represent measure- 
ments of x and p x , respectively. 

(ii) Commutation Relations. Commutation relations 
between operators that represent measurements of 
fundamental observables such as position, momen- 
tum, and components of angular momentum. The 
commutation relations [x, p x ) = ih and [L x ,L a ] = 
ihL z , are the obvious examples, while Dirac's Pois- 
son bracket rule, [A, B] = ih{A, B}, is the more 
general rule for evaluating commutation relations, 
where {A, B} is the classical Poisson bracket for ob- 
servables A and B of a physical system, and {A, B}, 
A, and B are the respective operators. 
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(iii) Transformation Operators. Explicit forms of the 
operators that represent symmetry transformations 
of a frame of reference, such as the cc-displacement 
operator D x = —i d/dx. 

(iv) Measurement-Transformation Relations. The rela- 
tions between measurement operators and transfor- 
mation operators. For example, the x-displacement 
operator, D x , stands in the relation D x = p x /h to 
the cc-momentum measurement operator, p x . 

The physical origin of many of the above-mentioned 
rules is obscure. For example, although one can give sim- 
ple physical arguments [28| for the operator rule which 
states that, if a measurement of A is represented by op- 
erator A, then a measurement of a function, f(A), of A is 
represented by operator /(A), the generalization of such 
arguments to measurements of functions of two or more 
observables encounters severe difficulties due to the pos- 
sible non-commutativity of the operators that represent 
these observables. As a result of such difficulties, oper- 
ator rules tend to be heuristic and tend to lead to in- 
consistencies when applied to particular examples [29j . 
Similarly, the commutation relationship [x, p x ] — ih is 
typically obtained from Schroedinger's equation or from 
Dirac's Poisson bracket rule, both of whose derivations 
involve abstract assumptions whose physical origin is ob- 
scure. 

Recent work on the elucidation of the physical ori- 
gin of the quantum formalism either focuses exclu- 
sively on the derivation of the abstract quantum for- 
malism 0, i, 1, i, i 0, 1 I M El El, El or is con- 
cerned with the derivation of the Schroedinger equation 
directly from informational ideas without taking the ab- 
stract quantum formalism as a given [IJ, [l5|, [Ty| . Con- 
sequently, the question of what additional physical ideas 
are needed to obtain the formal rules described above 
given the abstract quantum formalism has received rela- 
tively little recent attention. 

Operator rules have been discussed in a few publica- 
tions, for example in [lTilllj], but a derivation of the oper- 
ator rules on the basis of a physical principle, taking the 
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abstract formalism as a given, has not been successfully 
completed. The most recent systematic attempt to de- 
rive the commonly employed commutation relations and 
measurement-transformation relations in such a manner 
is found in [H| (see also Refs. [H SJ). In [H|, the 
commutation relationships for the operators that repre- 
sent the Galilean group of transformations are first de- 
rived by exploiting the group structure of the classically- 
described transformations. Then, by establishing the re- 
lation between these transformation operators and par- 
ticular measurement operators, commutation relations 
for these measurement operators are obtained. However, 
this approach implicitly makes use of operator rules, and 
relies upon auxiliary assumptions, such as the assump- 
tion that certain measurement operators are unchanged 
by the action of particular symmetry operators, whose 
physical origin is unclear. 

In this paper, we show that, starting from the ab- 
stract quantum formalism, it is possible to derive the 
above-mentioned formal rules in a straightforward man- 
ner from a correspondence principle. Roughly speaking, 
this principle, the Average- Value Correspondence Prin- 
ciple (AVCP), asserts that, in a classical experiment, if a 
relation holds between the outcomes of a set of measure- 
ments performed on a physical system, then the same 
relation holds on average in a corresponding, suitably- 
defined quantum experiment on the same physical sys- 
tem, the average being taken over infinitely many trials 
of the quantum experiment. 

This paper is organized as follows. We begin in Sec. |TT] 
by formulating the AVCP. In Sec. EH the AVCP is used 
to obtain several generalized operator rules which con- 
nect the average values of operators at different times, 
from which the commonly used operator rules of quan- 
tum theory follow as a special case. Using the AVCP and 
Postulate 3.4 (from Paper I), we then derive the explicit 
form of the temporal evolution operator, which completes 
the derivation of the finite-dimensional abstract quantum 
formalism begun in Paper I. 

Next, in Sec. IIV1 taking the infinite-dimensional form 
of the abstract quantum formalism as a given, we use the 
AVCP to derive many of the commonly employed formal 
rules of quantum theory, namely (a) the commutation 
relations [x, p x ] = ih and [L^, L y ] = ihL z , and Dirac's 
Poisson bracket rule, (b) the explicit form of the opera- 
tors for displacements and rotations, and (c) the relation 
between momentum and displacement operators, and be- 
tween angular momentum and rotation operators. 

Finally, in Sec. [V] we show that, in the func- 
tions f(xi) = ±cos(a% + b) and f(xi) = ±sin(axi + b) 
derived in Paper I, the signs can be taken to be posi- 
tive and the constants a and b can be taken to have the 
values a = 1 and 6 = without loss of generality. 

We note that the treatment of the formal rules is il- 
lustrative rather than exhaustive, so that only the most 
commonly encountered measurement and transformation 
operators which are needed to formulate non-relativistic 
and relativistic quantum mechanics have been discussed. 



Many other formal rules (such as the operators for Galilei 
transformations, temporal displacement, and discrete 
transformations such as spatial inversion) can be ob- 
tained by arguments which closely follow those presented. 
The paper concludes with a discussion of the results ob- 
tained. 



II. THE AVERAGE- VALUE 
CORRESPONDENCE PRINCIPLE 

A. Introduction 

Suppose that, as described in Paper I, a quantum 
model q(iV), of dimension N, has been constructed to 
describe an abstract experimental set-up consisting of a 
source of identical systems, a measurement set A, and 
an interaction set X. The measurements in A, and the 
degenerate forms of measurements in A [30|, are repre- 
sented by A-dimcnsional Hermitian operators (possibly 
with degenerate eigenvalues), and shall be called quan- 
tum measurements. 

Suppose that quantum measurement A, with opera- 
tor A, represents a measurement that is classically de- 
scribed as a measurement of some observable A (which 
we shall henceforth abbreviate to "quantum measure- 
ment A (or operator A) represents a measurement of A" ) . 
Suppose that we wish to determine whether there is a 
quantum measurement that represents a measurement of, 
say, A 2 and, if so, to determine the operator which can 
be said to represent this measurement. One can imagine 
that, classically, a measurement of A 2 is implemented 
by a process where a measurement of A is performed 
and the outcome is then squared. If this implementa- 
tion is described in the quantum model, it follows that, 
if the input state is an element, v^, of an orthonormal 
set of eigenvectors of A, the output state of the pro- 
cess is Vi and the observed result of the process is af, 
where Av^ = a^i (i = 1,2, ... ,N). According to this 
line of reasoning, it follows at once that the operator A 2 
therefore represents a measurement of A 2 . 

However, this simple argument does not readily gen- 
eralize. For example, suppose that quantum measure- 
ments A and B, with operators A and B, represent mea- 
surements of A and B, respectively. Suppose that we 
wish to determine the operator (if any exists) that repre- 
sents a measurement of A + B. Classically, one imagines 
that such a measurement is implemented by making mea- 
surements of A and B simultaneously on a system, and 
adding the outcomes. However, if [A, B] ^ 0, this imple- 
mentation cannot be described without modification in 
the quantum framework since the measurements cannot 
be performed at the same time, and the order in which 
they are performed is of potential significance. 

Now, the conventional operator rules of quantum the- 
ory assert that a measurement of A + B is represented by 
the operator A+ B. Although this seems entirely reason- 
able on a formal, symbolic level, it is not clear in what 
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physical sense A + B can be said to 'represent' the mea- 
surement since, whereas the eigenvectors and eigenvalues 
of A directly reflect the output states and outcome val- 
ues obtained when measurement A is performed on the 
system, the eigenvectors of A + B do not, in general, co- 
incide with the eigenvectors of cither A or B, and the 
eigenvalues of A + B do not, in general, coincide with 
the possible results of any plausible implementation (de- 
scribed in the quantum model) of a measurement of A+B 
that involves performing measurements of A and B and 
combining their outcomes. For example, a measurement 
represented by operator S x + S z on a spin-1/2 particle 
has possible outcomes ±h/y/2, whereas the possible re- 
sults of an implementation where a measurement of S x 
is followed by a measurement of S z (or vice- versa), and 
their outcomes are added, are ±h, 0. 

The above observations illustrate the difficulty of ob- 
taining a physical understanding of the operator rules of 
quantum theory even in simple cases of interest. Below, 
we shall formulate a physical principle which gives a clear 
physical meaning to the sense in which an operator can 
be said to represent a classically-described measurement, 
and, in the majority of cases of interest, uniquely deter- 
mines the operator which represents such a measurement. 



1. Implementations of classically-described measurements 

Consider again a measurement that, from the clas- 
sical standpoint, is said to be a measurement of A 2 , 
where A is some observable. Classically, one can imag- 
ine a measurement of A 2 being implemented in one of 
three different ways (see Fig. [T]): (i) make a measure- 
ment of A on one copy of the system, and square the 
outcome; (ii) make two immediately successive measure- 
ments of A on one copy of the system, and multiply the 
two outcomes; or (iii) make two simultaneous measure- 
ments of A on two separate copies of the system prepared 
in the same state, and multiply the two outcomes. Al- 
though the first of these implementations is the one we 
have considered above, all three implementations yield 
the same result when modeled classically, and so can be 
regarded as equally valid implementations of a measure- 
ment of A 2 . 

Now, perhaps surprisingly, when described using the 
quantum model, these implementations do not, in gen- 
eral, yield the same expected results. Consider a quan- 
tum experimental arrangement employing the first im- 
plementation. In each run of the experiment, one copy 
of the system is prepared in some given state. Let 
the probability that a measurement of A yields out- 
come i (i = 1,2, ... , N), with value a i; be denoted P,. 
Then, the expected result is given by 



Expected result = ^(a;) 2 Pj 



(1) 
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FIG. 1: Three implementations of the measurement classi- 
cally described as "a measurement of A 2 ". In (i), a mea- 
surement of A is made on one copy of the system, and the 
outcome is squared to give the result; in (ii), two immedi- 
ately successive measurements of A are made on one copy of 
the system, and the result is obtained by multiplying the two 
outcomes; and (iii) two simultaneous measurements of A are 
made on two separate copies of the system prepared in the 
same state, and the result is obtained by multiplying the two 
outcomes. In a classical model of this situation, each imple- 
mentation yields the same result. However, in the quantum 
model of these implementations, the expected result of (i) 
and (ii) is a 2 whereas the expected result of (iii) is (a) 2 . 



One finds that implementation (ii) yields the same ex- 
pected result. However, in implementation (iii), where, 
in each run of the experiment, two copies of the system 
are prepared in the same state, one obtains 



Expected result = /J(< 

i,i 

= («) 2 , 



(2) 



with Pi and Pj denoting respectively the probabilities 
that the measurements of A on the first and second copy 
yield outcomes i and j = 1, 2, . . . , N). 

In this example, the differences between the implemen- 
tations as viewed in the quantum model arise due to the 
fact that, in the quantum framework, an immediate rep- 
etition of a measurement on the same copy of a system 
is different from performing a second simultaneous mea- 
surement on a separate, identically-prepared, copy of the 
system. 

In the case of a measurement that is classically de- 
scribed as a measurement of A + B, where measurements 
of A and B are assumed to occur at the same time, one 
needs to take into account the additional fact that, in 
the quantum framework, the order in which two mea- 
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surements is performed is of possible importance. Ac- 
cordingly, one can imagine implementing a measurement 
of A+B in one of at least three different ways (see Fig. [2]): 

(i) make a measurement of A, and then a measurement 
of B, on one copy of the system, and add the outcomes; 

(ii) make a measurement of B, and then a measurement 
of A, on one copy of the system, and add the outcomes; 

(iii) make a measurement of A on one copy of the system 
and, simultaneously, a measurement of B on a second 
copy of the system prepared in the same state as the first 
copy, and add the outcomes. Once again, in a classical 
model of this situation, the results agree. However, if one 
calculates the expected results in the quantum model, 
one finds that, if [A, B] ^ 0, all three will, in general, 
disagree with one another. 
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FIG. 2: Fhree implementations of the measurement classi- 
cally described as "a measurement of A + B" . In (i) , a mea- 
surement of A, and then a measurement of B, is made on 
one copy of the system, and the outcomes are then added to 
give the result; in (ii), a measurement of B, and then a mea- 
surement of A, is made on one copy of the system, and the 
outcomes are then added to give the result; and (iii) simul- 
taneous measurements of A and B are made on two separate 
copies of the system prepared in the same state, and the result 
is obtained by adding the two outcomes. In a classical model 
of this situation, each implementation yields the same result. 
However, in the quantum model of this situation, the three 
implementations do not, in general, yield the same expected 
results. 

As these examples illustrate, different implementations 
of the same classically-described measurement do not, in 
general, give the same expected result in the quantum 
model of the experimental arrangement. Thus, whereas 



a classical description of a measurement, such as "a mea- 
surement of x 2 " , adequately describes the intended mea- 
surement insofar as the outcome is concerned in the 
classical framework, such a description allows for more 
than one implementation which, in general, do not yield 
the same expected result when modeled in the quantum 
framework. 



2. The average-value condition 

The existence of different implementations of the same 
classically-described measurement immediately raises 
two questions. First, are these implementations, in some 
sense, equally valid in the quantum framework, or it 
is possible to find some reasonable physical basis upon 
which to select particular implementations and regard 
these as more fundamental than the others? Second, is 
it possible to find operators that represent the selected 
implementations, and, if so, do all the selected implemen- 
tations of a given measurement have the same operator 
representation? 

To answer these questions, we begin by observing that, 
although the above implementations are all regarded as 
bone fide measurements in the classical model, a measure- 
ment is only describable as such in the quantum model, 
and so can be called a quantum measurement, if it can 
be represented by a Hermitian operator which represents 
a single measurement performed upon one copy of the 
system at a particular time. So, for example, although 
implementation (iii) of a measurement of A 2 can be mod- 
eled in the quantum framework, the process as a whole 
cannot be described as a quantum measurement since it 
involves two separate measurements. In contrast, imple- 
mentation (i) of a measurement of A 2 can be described 
as a quantum measurement since it only involves a single 
measurement on one copy of the system. 

However, although an implementation modeled in the 
quantum framework that involves more than one mea- 
surement cannot itself be regarded as a quantum mea- 
surement, we can reasonably ask whether it is possible 
to find a quantum measurement, C, with operator C, 
which, in some sense to be determined, can nonetheless 
be said to represent the implementation. 

At the outset, we note that it makes no sense to re- 
quire that measurement C yield the same outcome as a 
given implementation since measurement outcomes are 
only probabilistically determined in the quantum frame- 
work. However, we can impose the simple condition that, 
over an infinite number of runs, the average value of mea- 
surement C should coincide with the average result ob- 
tained from the implementation for any initial state of 
the system. 

For example, in the case of an implementation of a 
measurement of A 2 , measurement C which, by hypothe- 
sis, represents the implementation, must be such that, for 
all states of the system, (C) is equal to the expected result 
obtained from the implementation. In the case of imple- 
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mentations (i) and (ii) described above, using Eq. {T]), we 
accordingly obtain the condition that the relation 



(Q 



(A 2 ) 



(3) 



must hold for all states, v, of the system. From this 
condition, we can conclude that 



C 



(4) 



In the case of implementation (hi), using Eq. @, we 
obtain the condition that the relation 



(C) = (a) 2 
= <A) 2 



(5) 



must hold for all v. By diagonalizing A, one can readily 
show that this condition implies that A is a multiple of 
the identity, which represents a trivial measurement that 
yields the same outcome irrespective of the state of the 
system. Therefore, implementation (iii) does not satisfy 
the above average- value condition in the case of any non- 
trivial measurement of A, and can therefore be reason- 
ably eliminated. Hence, in this case, the average-value 
condition is sufficiently strong so as to be able to pick 
out implementations (i) and (ii), and, since the average- 
value condition also implies that these implementations 
are both represented by the operator, A 2 , one can unam- 
biguously conclude that a measurement of A 2 is repre- 
sented by the operator A 2 . 

Proceeding in a similar way, restricting ourselves for 
the time being to measurements A and B that are not 
sub-system measurements [3ll |. one finds that, in the case 
of a measurement of C = A+B, only implementation (iii) 
is possible if [A, B] ^ 0, which then yields the operator 



C = A + B. 



(6) 



If [A, B] — 0, then all three implementations are possible, 
and all yield the same operator, C, as above. 

Finally, in the case of a measurement of AB, 
with [A, B] = 0, one finds that the implementation must 
the one where measurements A and B are performed on 
the same copy of the system, in which case the opera- 
tor AB is obtained. 

Hence, we see that the average- value condition is suffi- 
cient to yield a unique operator representation for the 
measurements considered above. Based on the above 
considerations, we can tentatively formulate the follow- 
ing general rule: in the case of a measurement which has 
an implementation that contains two elementary mea- 
surements (not sub-system measurements) represented 
by commuting operators, it is possible to find a quantum 
measurement that represents the implementation if the 
two elementary measurements are performed on the same 
copy of the system; but, when the operators do not com- 
mute, the elementary measurements must be performed 
on different copies of the system. 



We now consider implementations of a measurement 
of AB in the case when [A, B] ^ 0. The general rule just 
given suggests that we should consider the implementa- 
tion of this measurement where the measurements of A 
and B are performed on different copies of the system. 
This implementation has the expected value 



]T5>^)p,p; = <a)<b) 



(7) 



with Pi and Pj denoting respectively the probabilities 
that the measurement of A on the first copy and the 
measurement of B on the second copy yield outcomes i 
and j j = 1, 2, ... , N), and aj, bj respectively denot- 
ing the values of the ith and jth outcomes of measure- 
ments A and B. Imposing the above average- value condi- 
tion, the operator C that represents this implementation 
must satisfy the condition 



(C) = (A)(B) 



(8) 



for all v. However, for non-commuting A and B, one finds 
that C cannot be found such that this relation is satisfied 
for all v. We note, however, that with 



C = i(AB + BA), 



(9) 



equation (jHJ holds for the eigenstates of A and the eigen- 
states of B, which suggests that we may be able to weaken 
the average-value condition in this case so that we only 
require that Eq. ([8]) holds for these eigenstates. However, 
as we shall illustrate later (Sec. MI A the application 
of Eq. © can lead to inconsistencies. Consequently, we 
conclude that, from the point of view of the average- 
value condition, it is not possible to find an operator that 
represents a measurement of AB when the operators A 
and B do not commute. More generally, we find that, 
when [A, B] 7^ 0, it is necessary to exclude measurements 
of f(A, B), with / analytic (so that / has a well-defined 
polynomial expansion in A and B), where the polynomial 
expansion of / contains product terms. 

Finally, in the case of a measurement which has a clas- 
sical implementation which contains two elementary mea- 
surements that are performed on different sub-systems of 
a composite system, one finds that the implementation 
can be represented by a quantum measurement irrespec- 
tive of whether the two measurements are performed on 
the same or on different copies of the system, and that 
the different possible implementations are represented by 
the same operator. 



3. Generalizations 

In the examples above, we have considered implemen- 
tations of a classically-described measurement in which 
the measurement and the elementary measurements in 
the implementation are performed at the same time, and 
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in the same frame of reference. However, as the following 
examples illustrate, these are unnecessary restrictions. 

First, classically, for a non-relativistic particle of 
mass m, one can implement a measurement of x per- 
formed at time t + St by a process where measurements 
of x and p x are performed at time t, and the func- 
tion x + p x St /m is then computed. 

Second, one can implement a measurement of x' on a 
particle in the reference frame S' that is displaced along 
the x-axis relative to frame S by performing a measure- 
ment of x in frame S, and computing the appropriate 
function x' = fix) that relates x and x' . 

The above considerations regarding the implementa- 
tion of a classically-described measurement are applica- 
ble without change to the case where the measurement 
and the elementary measurements in its implementation 
are performed at different times or in different frames 
of reference. Below, we shall accordingly generalize our 
notion of the implementation of a classically-described 
measurement. 



B. Statement of the Principle 

We shall now state a general principle which incor- 
porates the observations made above concerning the 
average- value condition. An illustrative example is given 
in Fig. El 

Average- Value Correspondence Principle 

Consider a classical idealized experiment in which 
a system (possibly a composite system) is pre- 
pared in some state at time to, and is allowed 
to evolve in a given background. Suppose that 
a measurement of A^ m ^ (m > 2), performed on 
the system at time t% with outcome a^ m \ has a 
classical implementation in which measurements 
of A^ x \ A^ 2 \ . . . , y4.( m_1 ) are performed upon one 
copy of the system at time t\, and their respective 
outcomes, denoted cS 2 \ . . . , a^ m ~ x \ are then 
used to compute the result f{a^ l \ cS 2 \ . . . , a'™ -1 )), 
where / is an analytic function, so that the relation 

fl H = ;( fl (i) )(I {J) ) ... )a M) (*) 

holds for all initial (classical) states of the system. 

Consider the case where the quantum mea- 
surements A^ 1 ), AS 2 \ . . . , A^ m \ with opera- 
tors A (1) , A (2) , . . . , A (m) , represent the mea- 
surements of A^ 2 \ . . . , A^ m \ respectively. 
Then, consider the following idealized quantum 
experimental arrangement consisting of several 
set-ups, each consisting of identical sources and 
backgrounds, where, in each set-up, a copy of the 
system is prepared in the same initial state, Vo, at 
time to- 
rn one set-up, only measurement A^ 1 ™) is per- 
formed (at time £2) and, for any i,j with i =/= j 



and i,j < m — 1, the measurements AW,A^' 
are performed (at time t\) in the same set-up in 
the quantum experiment if [A*- 1 -* , A^l = pro- 
vided that, if the system is composite, the mea- 
surements are performed on the same sub-system; 
and are performed (at time t{) in different set-ups 
if [A W ,A W ] =^0. 

Let the outcomes of the measure- 
ments AW, . . . , A^" 1 ) in any given run of the ex- 
perimental arrangement be denoted 
respectively. The function f{a^\ a (2) , . . . , a {m - 1] ) 
is defined as simple provided that its polynomial 
expansion contains no terms involving a product 
of eigenvalues belonging to measurements whose 
operators do not commute. If / is simple, then (fej) 
holds on average, the average being taken over an 
infinitely large number of runs of the experiment. 

The above principle can be generalized in a number of 
ways, for example to the case where the measurements 
of A^\ . . . , yl( m_1 ) are not performed at the same time. 
However, these generalizations are unnecessary for the 
derivations of the usual formal rules of quantum theory, 
and are therefore not discussed here. 



III. GENERALIZED OPERATOR RULES AND 
THE TEMPORAL EVOLUTION OPERATOR 

A. Generalized Operator Rules 

We will now apply the AVCP to derive operator rela- 
tions which hold when the function / takes various useful 
forms. In each instance of /, we shall first derive a gen- 
eralized operator rule which relates the expected values 
of the relevant operators at different times. Then, taking 
the special case when the expected values are computed 
at the same time, we obtain the corresponding operator 
rule which relates the operators themselves. 

We shall consider a classical experiment where a sys- 
tem is subject to measurements of A and B at time t%, 
and to a measurement of C at time ti. We shall suppose 
that measurement of C, with outcome c, can be imple- 
mented by an arrangement in which the measurements 
of A and B are performed, with respective outcomes a 
and b, and the function /(a, b) then computed, so that 
the relation 

c = fia,b) (10) 

holds for all initial states of the system. 

In a quantum model of the appropriate experimental 
arrangement, let the operators that represent these mea- 
surements be denoted A, B, and C, respectively. To sim- 
plify the presentation, we shall only consider the case 
where these operators have finite dimension, N; the re- 
sults obtained below can be readily shown to apply in the 
infinite dimensional case. Let the elements of orthonor- 
mal sets of eigenvectors of A, B and C be denoted Vj, v! '■, 
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0) 



System 1 



0e 



System 2 



B lb, 



f(a,b) 
I 

Equal 



c 





System 1 




System 2 




System 3 









System 3 



f(a,,bj) 



System 3 



Equal 

on 
average 

I 



Case 1. f is a function of a only. 

In this case, the quantum experiment simply consists of 
two identical set-ups, involving two copies of the system, 
where A is performed on one copy at time t\ and C on 
the other copy at time 1%. Since function / is simple, by 
the AVCP, the relation 



(ii) 



holds for all initial states, Vo, of the system. Explicitly, 
Pi 



I t i 2 



d'I I "t 

p k = v fc 'v t 



2 

2 I ) 



(12) 



with v t being the state of the relevant copy of the system 
at time t. Hence, we can write Eq. (fTTj) as 

£«W)vt a = ^ f E V * V I-«W ( 13 ) 



Noting that 



/(A)=^v iV J/(a i 

i 



(14) 



(ii) 



FIG. 3: An example of the application of the AVCP. (i) A 
classical experiment showing the measurements of A, B and C 
performed at times 4a, is, and tc, respectively, with outcomes 
denoted as a, b, and c, respectively. Here, tA = tB — t\ 
and tc = t2- Suppose that one finds that the relation c = 
f(a, b) holds for all initial states of the system, (ii) The cor- 
responding quantum experiment. Three copies of the system 
are prepared in the same initial state, vo, at time to, and are 
placed in identical backgrounds. In this example, it is as- 
sumed that the operators A and B do not commute. Hence, 
by the AVCP, measurements A and B are performed on dif- 
ferent copies of the system. Measurement C is performed on 
a separate copy of the system. In any given run of the experi- 
ment, the probabilities that measurements A, B and C yield 
outcome values a,i,bj and (i,j,k = 1,...,N), axe Pi,Pj 
and Pit , respectively. The AVCP then asserts that, pro- 
vided the polynomial expansion of f(a, b) contains no product 
terms involving a and b, the relation c = f(a, b) holds for all 
initial states, vo, of the system, where the average is taken 
over an infinite number of runs of the experiment; that is, 

J2 k c k PL' 



Ey /(a 



)P i P' j for all v . 



and Vfc, respectively k — 1, 2, ... , N), let the corre- 
sponding eigenvalues be denoted <Zj, bj and Cfc, and let 
the probabilities of the ith, jth and A;th outcomes of 
measurements A, B and C in any given experimental 
arrangement be denoted by Pi, Pj and P^', respectively. 



we obtain the relation 

(Q* 2 = </(A)> tl , 



(15) 



which holds for all Vo. We can summarize the above result 
in the form of the generalized function rule: 



c(ta) = /(a(ti))"(C) ta = (/(A)> ti Vv , 



(16) 



where, for clarity, the times at which the outcomes are 
obtained has been explicitly indicated. In the special case 
where t = t\ = £2, we obtain the usual operator rule, the 
function rule: 



c = f(a) ~ C = /(A). 



Case 2. f(a,b) = fi(a) + f 2 (b) 



(17) 



It is necessary to consider three sub-cases. First, if 
measurements A and B have commuting operators and, 
in the case of a composite system, if they are sub-system 
measurements performed on the same sub-system, then, 
by the AVCP, they are performed on the same copy of 
the system in the quantum experiment. Since / is simple, 
the AVCP applies, so that 



(18) 



E c ^ = E ^) + E M^PjM ) v 
k i \ j 

= S(/i(ai) + /a(6i))^. 
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holds for all initial states, vo, of the system. Here, the no- 
tation -P'^ is the probability that measurement B yields 
outcome j given that A has yielded outcome i; in this 
case, Pju = Sij. From the above relation, we obtain the 
generalized operator relation 

(Qt a = (/i(A)> ti + (/ 2 (B)> tiJ (19) 

which holds for all initial states, Vo. In the special case 
where t\ =t%, we obtain the operator relation, 

C = / X (A) + / 2 (B). (20) 

Second, in the case where measurements A and B arc 
sub-system measurements performed on different sub- 
systems, they can, by the AVCP, be performed on the 
same copy of the system, in which case we obtain the 
same results as above. 

Third, if measurements A and B have non-commuting 
operators, then, by the AVCP, they are performed on 
different copies of the system in the quantum experiment. 
Since / is simple, the AVCP again applies, so that the 
relation 

E c * p * = E fMPi + E f*W ( 21 ) 

k i j 

holds for all initial states of the system, which yields the 
same relation as in Eq. I|19j) . 

Hence, combining the foregoing three sub-cases, we ob- 
tain the generalized sum rule: 

c(t 2 )=f 1 (a(t 1 ))+f 2 (b(t 1 )) 

h+ (C) t2 = </i(A)> ti + </ 2 (B)> 4i Vv . (22) 

In the special case where t = t% = t 2 , we obtain the sum 
rule: 

c = A (a) + h{b) ~ C = A(A) + / 2 (B). (23) 

More generally, consider a classical experiment where 
measurements of , A^ , . . . , A^" 1-1 ^ are performed on 
a system at time t\ and a measurement of A^ at time t 2 , 
with outcomes . . . ,aS m \ respectively. Suppose that 
the relation 

a M = /( (i),a( 2 ),...,a^- 1 )), (24) 

where 

/(a« aW . . . , a^- 1 )) = h(a^) + ■■■ + f m -i(a^-% 

(25) 

holds in the classical model for all initial states, and that, 
in the quantum model, the measurements are represented 
by the operators A (1) , A (2) , . . . , A (m) , respectively. Then, 
one finds that the AVCP implies the generalized operator 
rule: 

Tit— 1 

a< m >(t 2 ) = £/i(° C0 (*i)) 

1=1 

m — 1 

~<A (m \ = E</'( AW )> tl Vv °' ( 26 ) 
1=1 



Taking the special case of simultaneous measure- 
ments (ti = t 2 ), we obtain the operator rule 

m— 1 m— 1 

a M = E fi A« = E Si (A W )- (27) 
i=i i=i 

Case 3. f(a,b) = /i(a)/ 2 (6) 

We again consider three sub-cases. First, if measure- 
ments A and B are represented by commuting operators 
and, if the measurements are sub-system measurements 
and are performed on the same sub-system, then, in the 
quantum experiment, they are performed on the same 
copy of the system. Since / is, therefore, simple, the 
AVCP applies, so that the relation 

k 4J (28) 

= E-M a *)-f 2 (W 

i 

holds for all initial states, v , of the system. Hence, the 
generalized operator relation 

(Q t2 = </i(A)/ 2 (B)) ti (29) 

holds for all Vo. In the case where measurements A, B, 
and C arc simultaneous, we obtain the operator relation 

C = /i(A)/ 3 (B). (30) 

Second, if measurements A and B are represented by 
commuting operators and are sub-system measurements 
performed on different sub-systems of a composite sys- 
tem, then they can be performed on the same copy of 
the system in the quantum experiment, in which case we 
obtain the same result as above. 

Third, if measurements A and B are represented by 
non-commuting operators, then the function / is not sim- 
ple, and the AVCP does not apply. 

We can combine the three foregoing sub-cases to obtain 
the generalized product rule: 

c(h)=fi(a(ti))f 2 {b(t 1 )) 
^ (Q t2 = </i(A)/ 3 (B)) ti Vv if[A,B] = 0. (31) 

In the special case where t\ = t 2j we obtain the product 
rule: 

c=/i(o)/ 2 (&)^C = /i(A)/ a (B) if[A,B]=0. (32) 

1. Some comments on inconsistencies 

As mentioned in Sec. Ill A 21 if the average- value con- 
dition is weakened to allow a measurement of AB to be 
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represented by an operator in the case where [A, B] 7^ 0, 
one is lead to a rule that is often stated, namely 

h{a)h{b) -> i(/i(A)/ 2 (B) + A(A)/ 2 (B)) (33) 

However, using this rule, one finds that inconsistencies 
quickly arise. For example, one can first apply this rule to 
find that the operator representing a measurement of AB 
is 

A~B = (AB + BA)/2, (34) 

where the notation X is used to denote the operator that 
represents a measurement of X. One can then apply the 
rule a second time to find the operator that represents a 
measurement of A 2 B. By treating this measurement as a 
measurement of A(AB), or as a measurement of (A 2 )B, 
one obtains, respectively, either 

A(AB) = i(AAB + ABA) 

= - (A(AB + BA) + (AB + BA)A) (35) 
= - (A 2 B + 2ABA+ BA 2 ) , 

or 

(A*)B= -{A^B + BA 2 ) 

I (36) 
= -(A 2 B + BA 2 ), 

which are, in general, inequivalent. Hence, the average- 
value condition cannot be applied to non-simple functions 
of observables, even in weakened form, without leading 
to inconsistencies. 

We also remark that, given the AVCP, it cannot con- 
sistently be maintained that every classically-described 
measurement is represented by a quantum measurement, 
since, under this assumption, the function and sum 
rules can be applied to a measurement of (A + B) 2 , 
with [A, B] 7^ 0, to derive Eq. (JM]) as follows. First, defin- 
ing d = a + b, we use the sum rule to find D = A + B, 
and then use the function rule to find that 

D 2 = D 2 

= A + AB + BA + B . 

Second, since, by assumption, a measurement of AB is 
represented by a quantum measurement, we can use the 
sum rule directly to find a measurement of D 2 = A 2 + 
2AB + B 2 : 

D 2 = A 2 + 2AB + B 2 

(38) 

= A 2 + 2AB + B 2 . 

Equating these expressions for D 2 , we obtain Eq. (f3"4"| . 
which, as we have seen, leads to an inconsistency. If 



the AVCP is accepted as valid, this inconsistency can 
only be avoided if we conclude that a measurement 
of AB (when [A, B] 7^ 0) cannot be represented by a 
quantum measurement, in which case the sum rule can- 
not be applied to obtain Eq. (f38|) . 

In summary, given that measurements of A and B 
are represented by quantum measurements A and B, 
one can use the AVCP to find quantum measurements 
that represent measurements of f(A), f\ (A) + / 2 (B) and, 
for [A, B] = 0, of /i(^4)/ 2 (£?); and, more generally, one 
can find quantum measurements that represent measure- 
ments of f(A, B) when / is simple. The AVCP also im- 
plies that a measurement of AB when [A, B] 7^ cannot 
be represented by a quantum measurement. However, 
this does not appear to be a significant restriction since 
such measurements do not occur in the Hamiltonian for 
a system of particles, in either non-relativistic or rela- 
tivistic physics, in which fundamental forces alone are 
acting. 

B. Temporal Evolution 

In this section, we will use the AVCP, together with 
Postulate 3.4 (see Paper I), to derive the explicit form of 
the temporal evolution operator for a system in a time- 
dependent background. 

Temporal evolution of the system is represented by a 
unitary transformation. Specifically, over the course of 
the interval [t, t + At], the state v(i) evolves as 

v(i + Ai) = U t (Ai)v(f), (39) 

where Ut(At) is the unitary matrix that represents tem- 
poral evolution of the system during [t, t + At]. 

Suppose now that the background of the system is 
time-independent during this interval. Then we shall 
write U t (At) as V t (Ai). Now, for < Ah + At 2 < At, 
and Aii, Ai 2 both positive, we have 

V t {Ah + At 2 ) = V t+ At 1 (A* 2 )Vt(Aii). (40) 

But the time-independence of the background implies 
that V t+Atl (At 2 ) = V t (Ai 2 ). Therefore, 

Vt(A*i+At 3 )=V t (At 2 )Vt(A*i), (41) 

which can be solved to yield 

V t (Ah) =exp(-iK t Ati), (42) 

where K t is a Hermitian matrix. 

To determine the nature of K t , we proceed as follows. 
In the classical model of a physical system in a time- 
independent interval [t,t + At], the classical Hamiltonian 
for the system is not explicitly dependent upon time dur- 
ing this interval. On the assumption that the classical 
Hamiltonian is a simple function of the observables of 
the system, and that the measurements of these observ- 
ables are represented by quantum measurements [32| , it 
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follows from the AVCP that the corresponding Hamilto- 
nian operator is also not explicitly dependent upon time 
during this interval, so that, in particular, 



which, upon expansion, yields 



H t = H 



t+At, 



(43) 



where H t denotes the Hamiltonian operator at time t. In 
addition, in the classical model, the total energy of the 
system is constant during this interval for all states of 
the system. Therefore, by the generalized function rule, 
the relation 



<H t > 



t — (Ht+Ai/t+Ai 



(44) 



holds for any state v. Hence, from Eqs. (|43l) and (|44|) . it 
follows that 



(Ht) 



(H t ; 



t+At 



(45) 



for all v. But 



(H t ) t+Af = (H t ) t + i([K t , H t ]) t At + 0(Ai 2 ). (46) 

Therefore, the commutator [K t , H t ] = 0, which im- 
plies that there exist N mutually orthogonal eigenvec- 
tors, Vi, . . . , vjv, which K t and H t share in common. In 
particular, the state Vj (j — 1,2, ... , N) is an eigenvector 
of H t , with some eigenvalue Ej. 

Now, if the system is in an eigenstate, Vj, with eigen- 
value kj, of K f , at time t, the state evolves as 



v(t + At) = e 



— ikj At 



v(f). 



(47) 



Therefore, during the interval [t,t + At], this state re- 
mains an eigenstate of H t , and is therefore a state of 
constant energy, Ej, during this interval. In addition, 
since evolution only affects the overall phase of the 
state, the observable degrees of freedom of the state are 
time- independent during this interval. But, by Postu- 
late 3.4, the state v(t), representing a system in a time- 
independent background of definite energy, Ej , whose ob- 
servable degrees of freedom are time- independent, evolves 
as 



v(i + At) = e-^ A */ a v(t). 



(48) 



By comparison of Eqs. (j47|) and (I48|) . we find that kj = 
Ej/a holds for j = 1,2, . . . , N , which implies that K< = 
Hi/a. Hence, in a time-independent background, any 
state v evolves as 



v(f + At) = exp (— £H t Ai/a) v(i). 



(49) 



In order to generalize to the case of temporal evolu- 
tion in a time-dependent background, we split the inter- 
val [t, t + At] into intervals of duration e, approximate the 
evolution during each of these intervals assuming that the 
background is time-independent, and then take the limit 
as e — > 0: 



Ut(At) = U t+At _ 



= lim{V t+A t- e (e 



(e)...U t+e (e)U t (e) 

V t+£ (e)V t (e)}, 



(50) 



so that 



U t (At) = /- -H t At + 0(At 2 ), 
a 



dv(t) , , , , 
ia— ¥- = H t v(i). 



dt 



(51) 



(52) 



The value of the constant a will be determined at the 
end of Sec. ITVAl 



IV. REPRESENTATION OF MEASUREMENTS 
AND SYMMETRY TRANSFORMATIONS 

In this section, we shall use the AVCP to obtain (a) the 
commutation relations [x, p x ] = ih and [L x , L y ] = 
ihL z (and cyclic permutations thereof), and a restricted 
form of Dirac's Poisson bracket rule, (b) the explicit form 
of the displacement and rotation operators, and (c) the 
relation between the momentum and displacement oper- 
ators, and between the angular momentum and rotation 
operators. 

From this point onwards, we shall take the infinite- 
dimensional form of the abstract quantum formalism as 
a given. 



A. Position, momentum, and displacement 
operators 

We shall proceed in five steps. First, by considering a 
particular system (a photon moving along the x-axis), we 
shall obtain the commutation relationship, [x, p x ] = ia. 
Second, we shall obtain the explicit co-ordinate rep- 
resentation of the displacement operator D x = ^4-- 
Third, we shall obtain the relationship D x = p x /a that 
holds between the displacement operator, D^, and p x , 
and thereby obtain the co-ordinate representation of p x . 
Fourth, we shall show that the relations obtained are 
generally valid. Finally, we shall make the identifica- 
tion a = h. 



1. The position-momentum commutation relationships 

Consider a photon moving in the -l-x-direction, where 
measurements of the x-component of position, the x- 
component of momentum, and the energy, are repre- 
sented by the operators x, p x , H, respectively. 

First, to determine the relationship of H to the op- 
erators x and p x , we make use of the fact that the re- 
lation H — cp x holds for all classical states (x,p x ) of 
the system, so that, from the function rule, it follows 
that H = cp x . 

Next, to obtain a relation between x and p x , we make 
use of the fact that, in the quantum model, the expected 



11 



value of x at time t + St can be calculated in two separate 
ways. First, from the definition of (x) t , the relation 

(x) t+St = (Ui(St)xU t (St)) t 



mst\ ( mst\\ ^ /r n 



(x) t + -St{Hx-xH) t + 0(St 2 ) 
a 

(x) t -^St([x lPx ]) t + 0(St 2 ) 



(53) 



holds for all states, v, of the system. Second, using the 
generalized function rule, it follows from the classical re- 
lation x(t + St) = x(i) + cSt + 0(St 2 ) that the relation 



(x) t+st = (x) t +cSt + 0(St 2 ) 



(54) 



holds for all v. 

Equating the above two expressions for (x) t +st, we ob- 
tain 



[x, p x ] v = ia 
for all v, which implies that 

[x, p x ] = ia. 



(55) 



(56) 



Although a particular system has been used to obtain 
this commutation relation, we shall later argue that it is 
generally valid. 



2. Co-ordinate representation of the displacement operator. 

Suppose that, in frame S, the system is in state ip( x )- 
The probability density function over x' in the frame S' , 
which is displaced a distance — e along the x-axis, can be 
calculated in two equivalent ways, according to whether 
the transformation from frame S to S' is treated as 
a passive or active transformation. Accordingly, the 
probability density function over x 1 can be obtained by 
performing measurements of x 1 in frame S' upon the 
system in state ip{ x )y or by performing measurements 
of x in frame S upon the system in the transformed 
state, exp(— ieD x )ip(x), and substituting x for x' in the 
resulting probability density function over x. 

First, in frame S', let us calculate the probability den- 
sity function over x' directly. In this frame, the opera- 
tor x' represents a measurement of x' . In the classical 
model of the system, the relation 

x' = x + e (57) 

holds for all states (x,p x ) of the system. Hence, by the 
function rule, we obtain the operator relation 

x' = x + e. (58) 

Hence, an eigenstate of x with eigenvalue x is an eigen- 
state of x' with eigenvalue x' = x + e. Therefore, if a 



measurement of x on a system in state ip(x) yields values 
in the interval [x,x + Ax] with probability |"0(x)| 2 Aa;, 
then a measurement of x' on a system in the same state 
yields values in the interval [x' , x' + Ax'] with probability 
density 



PT(x'\S',i ! (x)) = \^(x' -e)f 



(59) 



Second, in frame S, measurement x is performed on 
the system in the transformed state exp(— ieD x )ip(x), so 
that the probability density function over x is 

PY(x\S,exp(-ieD x )iP(x)) = \exp(-ieD x )tfj(x)\ 2 . (60) 



The probability density functions over x' in Eq. (( 59| 
and over x in Eq. (|60[) . must agree under the correspon- 
dence x <-*• x' . Hence 



^{x -e) = e l ^ x) exp(-ieD x )^(a;), 



(61) 



with <fi(x) being an arbitrary real-valued function of x, 
which is satisfied for any e if and only if <fi(x) = and 



D, 



ld_ 

i dx 



(62) 



3. The displacement-momentum operator relation. 

In a classical model, the state of a particle subject 
to measurements of ^-position and the ^-component of 
momentum, is given by (xoiPao) m some frame of refer- 
ence, S. Consider the following two experiments. 

In the first experiment, measurements of the x com- 
ponents of position and momentum of the particle are 
made in a reference frame, 5", that is displaced by a dis- 
tance e along the —x axis, resulting in the state (x',p' x ) 
of the particle relative to the co-ordinates of frame 5". 
According to the classical model, 



x = xo + e 

Px =PxO- 



(63) 



In the second experiment, the particle is displaced a dis- 
tance e in the +a;-direction, and measurements of po- 
sition and momentum are then performed in frame S, 
giving the state, (x,p x ), of the particle in frame S 
as (x + e,p x0 ). 

In classical physics, for all states of the particle, the 
state (a/,j4), determined by measurements in frame S' 
upon the undisplaced particle, is numerically identical to 
the state (x,p x ), determined by measurements in frame S 
upon the displaced particle. That is, 



(64) 



for all states, (xo,p x o), of the particle. 

Now consider a quantum model of the particle subject 
to measurements of x and p Xl and let the state of the 
particle be given by Vo in frame S. Consider the first ex- 
periment. From Eqs. (|63[) . it follows from the generalized 
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function rule that, in the quantum model of the particle, 
the relations 



(x'> = vjxvo + e 
(P'x) = v JPx v 0, 



(65) 



hold for all quantum states, Vo, of the system. 

In the second experiment, the displacement of the par- 
ticle is a continuous, symmetry transformation of the 
system, and therefore can be represented by a unitary 
transformation of the state, Vo, and, in particular, by the 
operator exp (— ieD x ), where D x is a Hcrmitian operator. 
To first order in e, measurements of x and p x performed 
on this state have expected values 



(x) = vj(l + ieD x )x(l - ieD x )\z 
(Px) = V J(! + i&x)p x {l ~ ieD x )v . 



(66) 



From Eq. by the generalized function rule, the 

average values (x 1 ) and (p' x ) of Eqs. (|65|) are respectively 
equal to the average values (x) and (p x ) of Eqs. (|66p for 
all vq. Hence, we obtain that the relations 



vj(l4 



f ieD x )x(l 
i&x)p x (l 



ieD x )\z Q 
ieD x )v 



v^xvo + 



(67) 



hold for all vo to first order in e, which yield the commu- 
tation relations 



[x, D x ] 
P-,DJ 



From Eqs. (JH]) and l[6"Sj) . it follows that 

[x, (D x - pja)] = 
[D aj (D x - pja)} = 0. 



(68) 



(69) 



Now, in the co-ordinate representation, the operators x 
and Da, are given by x and —iad/dx, respectively, and 
one can readily show that {x,— iad/dx} forms an irre- 
ducible set [23|. By Schur's lemma [3J|, it therefore fol- 
lows from Eqs. (|69|) that 



D X = ^+ 7 I, 



(70) 



where 7 is real since the operator (D^ — p x /a) is Hermi- 
tian. For a given displacement, e, the constant 7 results 
in the same overall shift of phase of any state, v, of a 
system, and therefore produces no physically observable 
effects on the system. Hence, 7 can be set equal to zero 
without any loss of generality, so that we obtain 



D X = K 

a 



(71) 



Analogous relationships for the displacement operator 
corresponding to displacements in the y and z directions 
can be obtained in a similar way. 

Finally, from Eqs. (JB^Jl and (J7TJ) , we find 



a d 
Px = Ida' 



(72) 



4- Generality. 

The representations of x- and p^-measurements have 
been obtained above by considering, in the first step, a 
quantum model of a particular physical system, namely 
a photon moving along the x-axis. In the general case 
of a photon moving in an arbitrary direction, measure- 
ments of x,y,z, and p x ,p y ,p z are sub-system measure- 
ments, and can therefore be represented in the model of 
the composite system consisting of a photon, subject to 
measurements chosen from a measurement set generated 
by a measurement of r — (x, y, z), by the operators x, y, z 
and —iad/dx, —iad/dy, —iad/dz, respectively. 

These representations of measurements of position and 
momentum are also more generally valid for other sys- 
tems, as we shall explain below. 

a. State- determined measurements. Suppose that, 
in the classical framework, a measurement of A is per- 
formed on a system, and the outcome is determined by 
the state of the system alone. That is, in particular, the 
outcome is independent of the background of the system 
or of any parameters (such as charge or rest mass) that 
describe intrinsic properties of the system. We shall then 
say that this measurement is a state- determined mea- 
surement. For example, the outcome of a position mea- 
surement on a particle is determined by the state of the 
particle, and is independent of whether or not the par- 
ticle is in an electromagnetic field and is independent of 
the mass or charge of the particle. In general, any mea- 
surement of an observable that is a function only of the 
degrees of freedom of the state of the system is a state- 
determined measurement. In contrast, a measurement 
of the total energy of a system is, in general, dependent 
upon not only the state of the system, but also upon the 
background of the system, and is therefore not a state- 
determined measurement. 

Now, consider two quantum models of two differ- 
ent physical systems, system 1 and system 2, in differ- 
ent backgrounds, with respect to the measurement set 
generated by measurements Ai and A2, respectively, 
where Ai and A2 represent a measurement of A per- 
formed on the respective systems. Suppose, further, that 
the two models have the same dimension. If the mea- 
surement of A is state-determined when performed on 
both systems 1 and 2, then, by the AVCP, it follows 
that (Ai) = (A2) holds for all states, v, where opera- 
tors Ai, A2 represent measurements Ai, A2, respectively. 
It follows at once that the operators Ai, A2 are identical. 

Hence, provided that two systems admit classical mod- 
els with respect to a measurement of A that is state- 
determined, and admit quantum models of the same di- 
mension with respect to measurements Ai and A2, the 
operators that represent Ai and A2 in the respective 
models must be identical. 

Therefore, if state-determined measurements of x 
and p x are performed on any system, then, in a quan- 
tum model of the system subject to measurements in 
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the measurement set containing quantum measurements 
that represent measurements of x and p x , where these 
measurements yield a continuum of possible outcomes, 
their representations are the same as those obtained 
above. Therefore, the commutation relations involv- 
ing x, p x and Da; are also generally valid. Similar con- 
clusions hold for measurements of y, z and p y ,p z . 

Therefore, in the case of a particle where the inter- 
action energy in the Hamiltonian is obtained from a 
scalar potential that is dependent on position only, in 
which case the measurements of position and momen- 
tum are state-determined, the above representations are 
valid. Below, we shall consider a physically important 
case where the measurement of momentum is not state- 
determined. 

b. Particle in a magnetic field. In the case of a 
charged particle in a magnetic field background described 
in the Hamiltonian framework, the state of the parti- 
cle is (x, x), but the generalized co-ordinates are taken 
to be (x, p), where p = rax + eA, where m and e 
are the mass and charge of the particle, respectively, 
and A = (A x ,A y ,A z ) is the vector potential. In this 
case, p depends both upon the state of the particle and 
the state of the background. Therefore, a measurement 
of p is not a state-determined measurement, and the 
foregoing argument cannot be used to argue that the 
operators representing the measurements of the compo- 
nents, p x ,p y ,p z , of p are those derived above. Instead, 
we reason as follows. 

First, for a particle with state (x, x) in a magnetic field, 
in the argument of Sec. lIV ATI the commutation relation 
for the x-component of the motion in Eq. (|56p becomes 

[x, mx] = ia, (73) 

Then, from p = mx + eA, the sum rule gives 

p x = mx + eA x (x, y, z) , (74) 

which, together with Eq. (|75|) . implies that 

[x, pj = ia, (75) 

as before. 

Second, we note that, in the classical framework, the 
momentum p as defined above is invariant under dis- 
placement of the reference frame. Therefore, Eqs. 
remain unchanged, and, using Eq. ([75)1 . we obtain = 
p x /a. Third, and finally, the argument leading to the co- 
ordinate representation of remains unchanged since 
the argument only involves measurements of position, 
which are state-determined measurements. Therefore, 
the explicit representation of p x remains that given in 
Eq. (|72|) , and similarly for the y- and z-components of 
the motion. 

5. Identification of a — h. 

At this point, having obtained explicit representations 
for position and momentum measurements, it is possi- 



ble to use the operator rules to write down the explicit 
Schroedinger equation for a structureless electron in a 
hydrogen atom. By solution of the equation, and by 
comparing the energy levels of the electron either with 
those found in Bohr's model or with those found by ex- 
periment, one can establish that the constant a is equal 
to h. 



6. Remark on applications. 

The formal rules derived above allow the quantum the- 
oretic modeling of a non-relativistic particle in an arbi- 
trary classical background consisting of gravitational and 
electromagnetic fields, which leads to the non-relativistic 
Schroedinger equation. In the case of a multi-particle 
system, the rules (not discussed here) for dealing with 
identical particles are, additionally, required. 

In addition, the above rules allow the modeling of a 
photon without consideration of polarization degrees of 
freedom (leading to a complex wave equation), a struc- 
tureless relativistic particle (leading to the Klein-Gordon 
equation), and a relativistic particle with internal de- 
grees of freedom (which, with the appropriate auxiliary 
assumptions, leads to the Dirac equation). 

B. Angular momentum and rotation operators 

We shall proceed in three steps. First, we shall obtain 
the commutation relation [L^, L y ] = ihL z , and cyclic per- 
mutations thereof, up to an additive constant. Second, 
we shall obtain the commutator relations that hold be- 
tween the rotation operators, R x ,R y ,R z , and the angular 
momentum operators, and shall then use these relations 
to determine the value of the additive constant. Third, 
we shall determine the relations that hold between the 
rotation and angular momentum operators, and indicate 
how the explicit representations of the angular momen- 
tum operators and rotation operators can be determined. 

1. Components of Angular Momentum. 

Consider an experimental set-up where, in the classical 
model of the set-up, measurements are performed upon 
a classical spin, with magnetic moment /x, which deter- 
mine the values of the rectilinear components of angular 
momentum of the system. Suppose that the measure- 
ments of the components of angular momentum along 
the x-, y-, and z-directions, and the measurement of en- 
ergy, are represented by the operators L x , L y , L z , and H, 
respectively. 

In particular, consider a set-up where a magnetic 
field, B, is applied to the spin. In the classical model 
of this set-up, the energy associated with the interac- 
tion is —fi ■ B. Since fi = qL/2m, where q and m are 
the charge and mass, respectively, of the spin, and L = 
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(L x , L y , L z ) is its angular momentum vector, the en- 
ergy can be written as — (q/2m)H ■ L. By the sum 
rule (Sec. MI Aj) . the quantum mechanical Hamiltonian 
is given by 



H 



2m 



(B x 



ByLy 



B 7 L 7 



(76) 



where B x , B y and B z are the rectilinear components of B. 

The application of a magnetic field to a classical spin 
causes its angular momentum vector, L, to rotate about 
the axis along which the magnetic field is applied by an 
angle that is proportional both to |B| and to the duration 
for which the field is applied. Let the rotation matrix cor- 
responding to a rotation about axis a be denoted R a {9), 
where 9 is the angle of rotation. From the properties of 
rotation matrices, it follows that 

R x {e)R y {e) - R y (e)R x (e) = R z {e 2 ) - /, (77) 

where e is an infinitesimal angle, and -R a (e) is an infinites- 
imal rotation which can be implemented by application 
of a magnetic field B along the axis a for some time St. 
Using this relationship, it is possible to deduce the com- 
mutation relations that hold between the quantum me- 
chanical operators, L x , L y , L z , in the following way. 

The unitary evolution corresponding to the application 
of a magnetic field B to a spin for a time St is 



U(St) =exp [ 



(78) 



Ui(at) 


= i - 


-L 

h x 


u 2 (*t) 


= i - 


-L 

h y 


U 3 (5t) 


= i - 


-L 

h ' 



If magnetic fields of equal strength are applied for equal 
times, St, along the x, y, and z-axes, respectively, the 
corresponding unitary evolution is given to first order 
in St, respectively, by 



(79) 



Define proj(v) as the operation upon the quantum 
state, v, of a spin which returns a three-dimensional vec- 
tor, (L), with components (La,), (L y ) and (L z ). 

If the application of a magnetic field, B = B z k, say, to 
a classical spin causes a rotation of L by angle 9, then, by 
the generalized operator rule in Eq. (|27p , in the quantum 
model of the spin, the application of the field rotates the 
vector (L) = proj(v) by the angle 9 about the z-axis. 
From Eq. (|77[) . it therefore follows that, for any v, 

proj(U 1 (e)U 2 (e)v) - proj(U 2 (e)U 1 (e)v) = 

proj(U 3 (e 2 )v)-proj(v). (80) 

Using the definitions 

v i = U 3 (e 2 )v = v + 5\ii 

v 2 = Ui(e)U 2 (e)v = v + <5v 2 

V3 = U 2 (e)Ui(e)v = v + 5v 3 , 



where 

r * 2i 

dvi — — — e L z v 

5m 2 = 
<5v 3 = 

equation ([80]) becomes 



1 

w 



proj(v + <5v 2 ) — proj(v + Sv^) — proj(v + Svi) — proj(v). 

( 81 ) 

Equating the ^-components of this equation, we obtain 

v t L ;c ((5v 2 - 5v 3 ) + (5v 2 - <5v 3 ) t L a; v = v^^^vi + <5v{L x v, 

. (82) 

and, inserting the explicit forms of the Svi, we obtain the 
commutation relation 



[L x , L, + -[U,L,,]] =0. 



(83a) 



Equating the y- and z-components similarly, one obtains 
the relations 



[Ly, L z + -[L x ,L y ]} =0 



[L z , L z + ^[L x ,L y ]] =0 



(83b) 
(83c) 



By inspection, the above commutation relations have the 
solution 



[La,, L y ] = ihL z + Z7il, 



(84a) 



where 71 is real constant since the operators ^L^Ly] 
and L z are hermitian. We shall later show that this so- 
lution is, in fact, the most general one. 

The discussion leading to this result can be repeated 
to yield the relations 



[L y , L z ] = ihL x + i7 2 l 
[L z , La,] = ihLy + 173!. 



(84b) 
(84c) 



In order to determine the values of 7-factors, we require 
the commutation relations between the rotation opera- 
tors, R x ,R y ,R z , and La,,L y ,L 2 , which we shall now de- 
rive. 



2. Rotation-angular momentum commutation relations. 

Let an infinitesimal clockwise rotation of a frame of 
reference by angle e about the z-axis be represented by 
unitary transformation exp(— ieR z ), where R z is Hermi- 
tian. 

Now consider a set-up where measurements of L x ,L y , 
and L z are performed on a system in the original and 



15 



in the transformed frame of reference. In the classical 
model of this situation, the outcomes of the measure- 
ments performed in the original (unprimed) and trans- 
formed (primed) frames are, to first order in e, related 
by 



(85) 



By the generalized operator rule in Eq. (|27l) , it follows 
that, in the quantum model of the situation, the relation 



4- Explicit form of angular momentum operators 







— e 




/(U 
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9 


<L„ 













(86) 

holds for all states, v, of the system. Using the relation 
(l' x ) = vt(l + ieR,)L s (l - ieR z )v + 0(e 2 ), (87) 
we find from Eq. (|86|) that 

(L x )+ie<[R„U]> = (L x )-e(L y ) 
for all v, which implies that 



[R z , L x ] = iL y . 

Proceeding similarly for (L^) and (L z ), we obtain 

[R z , L y ] = -iL x 
[Rz,\-z] =0. 



(89a) 



(89b) 
(89c) 



The commutation relations for R x and R y can be ob- 
tained by parallel arguments. 



3. Angular momentum commutation relations. 

Left-multiplying Eq. (|84c[) by R z , and applying 
Eqs. (|89a |) -(|89c )) . we obtain 

[L z ,L x ]R z -i[L y ,L z ] = (ihL y +i 73 \)R z +HL x . (90) 

Using the Eq. (|84c|) right-multiplied by R 2 , this implies 
that 



[L y , l z ] = ihl x . 



(91a) 



Parallel arguments applied to Eqs. (|84a|) and (|84b[) using 
the commutation relations between R x , R y and L x , L y , L z 
yield 



[L z , U] = ihly 
[U, Ly] = ihL z . 



(91b) 
(91c) 



From the classical relation 1? = L\ 
lows from the sum rule that L 2 = Li + 



L 2 it fol- 



- X + L y + Ll. Using this 
relation and the above commutation relations for L x , L y 
and L z , explicit representations of these operators for fi- 
nite N can be obtained and the irreducibility of the rep- 
resentations of L x , L y , L z can be shown [35[. Therefore, 
by Schur's lemma, the solution given in Eq. (|84ap is the 
most general solution of Eqs. (|83a| - (l83cl) . and similarly 
for the solutions given in Eqs. (|84b|) and (I84c| . 

Although the representations of L x , L y and L z have 
been obtained by considering a particular physical sys- 
tem, they are generally valid on account of the argument 
given in Sec. IIV A 41 Therefore the commutation rela- 
tions for L x , L y and L z are generally valid. 



5. Rotation-angular momentum relations, and explicit 
form of the rotation operators. 

Using the commutation relationships for L^, L y and L z , 
it follows from Eqs. (|89a[> — (|89c[) that (hR z — L z ) commutes 
with L x ,L y , and L z . Since {L,,, Ly, L z } is an irreducible 
set, it follows from Schur's lemma that 



HR Z - L z = 7 I, 



(92) 



where 7 is a real-valued constant since R z and L z are 
Hermitian. For any given e, a non-zero value of 7 re- 
sults in the same overall change of phase for all states 
transformed by exp(— ieR z ), and so cannot give rise to 
observable consequences. Hence, 7 can be set to zero 
without loss of generality. Therefore, R z — L z /h, and, 
similarly, = L x /h and R y — L y /h. 

Using the explicit representations of L^, L^, L z for any 
given dimension N, the explicit representation of the 
rotation operators follows at once from these rotation- 
angular momentum relations. The explicit co-ordinate 
representations of the rotation operators in the infinite- 
dimensional case can also be determined by an argument 
similar to that used earlier to determine the explicit form 
of the displacement operators. 



C. Commutators and Poisson brackets 

In this section, we shall obtain a relation between 
the Poisson Bracket, {A, B}, and the commutator [A, B], 
where A and B are the classical observables of a physical 
system describable in the classical Hamiltonian frame- 
work, and A, B are the operators that represent measure- 
ments of these observables. Dirac's Poisson Bracket rule 
asserts the relation 



[A, B] = ih{A, B}, 



(93) 



where {A, B} is the operator that represents a measure- 
ment of {A, B}. Below, we shall derive this relation using 
the AVCP in the case where B is the Hamiltonian. 
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Consider the Hamiltonian model of a system 
with state (qx, . . . , qNlPii ■ ■ ■ >Pn) where N > 1. 
The temporal rate of change of the func- 
tion F(qi, . . . , <?jv;Pi! • • • )Pn) is given in terms of 
the Hamiltonian, H(q±, . . . ,qN\Pi> ■ ■ ■ ,Pn), by 



F = {F,H} 



N 



dH dF 



\ dm dpi dq % dpi 



(94) 



Consider the quantum model of the system with state v, 
where the measurements of the qi and the pi are repre- 
sented by operators and p i5 respectively. If H is simple, 
then, by the AVCP, a measurement of H can be repre- 
sented by the operator H; otherwise, according to the 
AVCP, it is not possible to describe the temporal evolu- 
tion of the system in the quantum model. If both of the 
functions F and {F, H} are simple, then, by the AVCP, 

they are represented by the operators F and {F, H}, re- 
spectively, and from Eq. ([94]) . by the generalized function 
rule, the relation 



(F) t = ({F,H}) t , 



(95) 



holds for all v. 

Now, in the classical model, the function F is defined, 
for all states, as 



F= lim 

At^O 



F{t + At) - F(t) \ 
At J 



(96) 



If F(t) and F(t + At) are both simple, then, according to 
this definition, F(t) is also simple, and, using the gener- 
alized sum rule (regarding the measurement of F(t + At) 
as the one being implemented in terms of measurements 
of F(t) and F(t)), we obtain the relation 

(^) t = A l™^{(F) t+Ai -(F) t }, (97) 

which holds for all v, with the operator F representing a 
measurement of F. 

If the functions {F, H} and F(t) are both simple, then, 
since F(t + At) = F(t) + {F, H}At, it follows that F(t + 
At) is also simple. In that case, both Eqs. (|93|) and (f9"T|) 

hold for all v. Equating these two expressions for (F) t , 
we obtain the relation 



({F~H}) t = lim J- 
u J/t At^o At 



l + ~HAt) F( 1--HA* 
h j \ h 



-(F), 



= ^- 1 {[H,F]) t , 

which holds for all v, so that 
ih{FH} -- 



[F,H]. 



(98) 



(99) 



Hence, we obtain Eq. ([93)1 in the special case where B = 
H, subject to the condition that the functions A, B 
and {^4, B} are simple. 

Using this relationship, we can readily evaluate use- 
ful commutation relationships. For example, for a pho- 
ton with state (x,p x ), setting F — x, and H = 
cp x (the Hamiltonian for a photon moving along the +x- 
direction), we find {F, if} = c. Hence, since the func- 
tions F, H, and {F, H} are simple, Eq. (|99[) immediately 
gives [x, pj = ih. 

If one or more of the functions F, H, and {F, H} is not 
simple, then Eq. (|9"9"|) docs not follow from the above 
argument. To take a specific example, suppose that, 
for a system with state (x,p x ), where [x, pj = ih, we 
choose F = x 3 and H = jp x , where 7 is a constant. 
We can then apply the function rule to obtain the corre- 
sponding operators F = x 3 and H = -fp 3 ,, and use these 
to find 



[F, H] = 3ijh 



xp^x 



P^x 2 ) 



(100) 



However, the function {F, H} = 9jx 2 p x is not sim- 
ple, which implies that the AVCP cannot be used to 
write down an operator which represents a measurement 
of {F, H}. If we were nonetheless to apply the Hermiti- 
zation rule in Eq. (|33[) to a measurement of {F, H} (in 
spite of the inconsistencies to which we have shown this 
would lead) we would obtain 



ih{F, H} 



x 2 P 2 



P 2 x 2 ) 



(101) 



but this differs from [F, H] by the constant 27ft 3 . Since 
the expected value of r yhx 2 p 2 is itself of order 7ft 3 , there 
is no guarantee that the difference between Eqs. (|100|) 
and p01[) will be negligible. 

The question of whether Eq. (|93|) holds in the more 
general case where the function B cannot be treated as 
the classical Hamiltonian of the system is not discussed 
here. 



V. ARBITRARINESS IN THE 
FUNCTIONS f( Xi ) AND f( Xi ) 

In Paper I, we found that the functions f(Xi) 
and f(xi) (not to be confused the function / that forms 
part of the AVCP) are ± cos(axi + b) and ± sin(axj + b), 
respectively, where a, b € K and a ^ 0. Having obtained 
the temporal evolution operator and commutation rela- 
tionships, we are now able to show that the choices made 
in Paper I of the positive signs for / and / and of a = 1 
and b = do not involve a loss of generality. 

Let us first consider the case where positive signs are 
chosen for both / and /. With constants a and b left in 
place, the derivation given in Paper I is altered as follows. 
First, with Q a u = cos(a\i + b) and Q&u = sin(axi + b), 
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and defining Xi = a Xi + b, we obtain 
Q = (\fl\Qa\i-, V^Qfali, , • • ■ 




Pat sin xn), 



(102) 



where the Pi are the probabilities of the observed out- 
comes of the measurement, A, with respect to which Q 
is written. 

Second, the invariance condition (Postulate 3.2) re- 
quires that there is no change in the probabilities of the 
observable outcomes of measurement A performed on an 
evolved state if an arbitrary constant xo € K is added 
to each of the Xi in the initial state. Hence, in terms of 
the Xi, an arbitrary constant Xo may be added to each 
of the Xi- Therefore, from the argument given in Pa- 
per I, the state can, with respect to measurement A, be 
represented as the complex vector 
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(103) 



with physical transformations being represented as uni- 
tary or antiunitary transformations as found previously. 

Third, Postulate 5 implies that Xij = +xf \ which 
implies that the composite systems rule remains v = 
vW eg) v' 2 ), where vW,v( 2 ) are the states of the sub- 
systems, and v is the state of the composite system. 

Fourth, by inspection of the argument used above 
to find the temporal evolution operator, one finds 
that (it (At) = cxp(—iaHAt/a). In addition, the forego- 
ing arguments lead to p x — —i(a/a) d/dx. If one writes 
down the Schroedinger equation implied by these rela- 
tions, one obtains a/a — h, which implies that (it(At) = 
exp(—iHAt/h) as before. 

In summary, irrespective of the values of a, b, one ob- 
tains the same abstract quantum formalism. Further- 
more, although the constants a, b appears as a connec- 
tion between x% an d Qa\ii we shall now show that they 
are unimportant insofar as observable predictions of the 
formalism, namely the prediction of the values and prob- 
abilities of the observable outcomes of measurements, are 
concerned. To do so, we shall consider an experiment 
where a system is prepared using measurement A and, 
after undergoing a physical transformation (either active 
or passive), is subject to measurement A'. 

We shall first establish that the explicit form of a mea- 
surement operator that represents any given measure- 
ment is independent of a and b. First, consider the exam- 
ple of the operators x and p x . The explicit form of these 
operators is determined by (a) the commutation rela- 
tion [x, p x ] = ih, (b) the measurement-transformation re- 
lation D x = p x /h, and (c) the relation = —id/dx. In 
this case, one finds by inspection of the above derivation 
that these relations are all independent of a and b. More 
generally, the explicit form of measurement operators is 
determined by (a) commutation relationships between 



measurement operators, (b) the relation between mea- 
surement operators and transformation operators, and 
(c) the explicit form of transformation operators. First, 
since the Dirac Poisson bracket rule derived in Sec. IIV CI 
is independent of a and b, the commutation relations 
between measurement operators are independent of a 
and b. Second, the measurement-transformation rela- 
tions and the explicit form of the transformation rela- 
tions are directly obtained via the AVCP from the classi- 
cal relation between the outcomes of measurements per- 
formed in the original and transformed frames of refer- 
ence, and are therefore independent of a and b (as il- 
lustrated by the above arguments leading to D x = p x /h 
and Da, = —id/dx). Therefore, in general, measurement 
operators are independent of a and b. 

Next, we consider the operators that represent physical 
transformations. First consider active transformations. 
The general temporal evolution operator, (it(At) — 
exp(— iH t At/h) is not explicitly dependent upon a and b. 
The operator H t is a function of the measurement opera- 
tors (such as x, p x for a particle moving along the x-axis) 
relevant to the system, and is directly obtained (via the 
AVCP) from the classical relation between the classical 
Hamiltonian, H, and the classical observables that de- 
termine the state of the system, and is therefore also in- 
dependent of a and b. Since the measurement operators 
are themselves independent of a and b, it follows that 
the operator representing any given temporal evolution 
is independent of a and b. 

Second, the explicit form of operators representing pas- 
sive transformations (such as displacement or rotation of 
a reference frame) can be obtained by directly transpos- 
ing the relevant classical relations (which connect the co- 
ordinates in the original and transformed frames) into the 
quantum framework via the AVCP (as illustrated by the 
derivation of the co-ordinate representation of above) , 
and can be readily seen to therefore also be independent 
of a and b. Hence, a unitary or antiunitary operator that 
represents any given physical transformation is indepen- 
dent of a and b. 

Now consider the above-mentioned experiment where 
measurement A is performed and, say, outcome 1 is ob- 
tained. With respect to measurement A, the resulting 
state is v = (1, ... , 0) T up to an irrelevant overall phase. 
Suppose the system now undergoes a physical transfor- 
mation (either active or passive), and a second measure- 
ment, A, is performed. Since the physical transformation 
is represented by a unitary or antiunitary operator that is 
independent of a and b as noted above, the outcome prob- 
abilities of the second measurement are unaffected by the 
value of a and b. This example includes the general case 
where the second measurement is measurement A' since 
the latter can, by Postulate 1.2 (see Paper I), be repre- 
sented by an arrangement consisting of measurement A 
immediately preceded and followed by suitable temporal 
evolution of the system. Therefore, in the most general 
experiment we are considering, the outcome probabili- 
ties of the observable outcomes are also independent of a 
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and b. 

Finally, we note that, since the explicit form of the 
measurement operators is independent of a and b, their 
eigenvalues are also independent of a and b. Therefore, in 
summary, we find that both the probabilities and the val- 
ues of the observable outcomes of measurements in the 
above general experiment are independent of a and b. 
Therefore, the values of a and b can, without loss of gen- 
erality, conveniently be chosen to be a = 1 and 6 = 0. 

If one chooses the signs of / and / not to be both pos- 
itive, then this is equivalent to choosing positive signs 
for / and / but changing the values of a and b to some 
other values, a' and b' , respectively. Specifically, if one 
chooses the signs (+, — ), then a' = —a and b' = — b + it: 
if (—,+), then a' = —a and b' — —6; and, if (—,—), 
then a' = a and b' = b + n. But we have already shown 
that the choice of the constants a and b is unimportant, 
and hence the change in their values is unimportant. 
Therefore, the signs can, without loss of generality, both 
be chosen to be positive. 



VI. DISCUSSION 

The derivation presented in this paper has shown that, 
using a clearly-motivated physical principle, it is possi- 
ble to derive the explicit form of the temporal evolution 
operator given the postulates of Paper I, and to derive 
the formal rules of quantum theory in a systematic man- 
ner from appropriately chosen relations known to hold in 
classical physics. The derivation provides several physi- 
cal insights into the formal rules. 

The first insight is that the classical description of a 
measurement (such as 'a measurement of x 2 ') leaves open 
the possibility of more than one implementation, and 
that, when modeled in the quantum framework, these 
implementations are, in general, not equivalent. 

Second, it is possible to impose a simple average-value 
condition that must be satisfied by an operator that 
can be said to represent a classical implementation of a 
measurement. This condition implies that many imple- 
mentations cannot be represented by an operator, and 
can therefore be eliminated from consideration. That is, 
one finds that there are implementations which, although 
acceptable in the classical framework, cannot be repre- 
sented by operators in the quantum framework without 
violating a very mild average-value condition. 

Third, in the case of an implementation that satis- 
fies the average- value condition, the operator that repre- 
sents the implementation is uniquely determined by the 
average- value condition provided that the function, /, 
that describes the implementation, is simple. One also 
finds that those implementations that satisfy the average- 
value condition are are represented by the same operator, 
so that it is possible to represent a measurement of / by 
a unique operator. If / is not simple, then it does not ap- 
pear to be possible to apply the average- value condition, 
even in a weakened form, without inconsistencies arising. 



Fourth, we have found that the AVCP is incompati- 
ble with the assumption that every classically-described 
measurement on a system is represented by a quantum 
measurement in the quantum model of the system. For 
example, the AVCP implies that a measurement of AB 
does not have an operator representation if [A, B] ^ 0. 

The fifth insight rests on the fact that, rather sur- 
prisingly, the AVCP enables formal rules of each of the 
four types described in the Introduction (operator rules, 
commutation relations, transformation operators, and 
measurement-transformation relations) to be obtained 
in a uniform manner. Consequently, one can see that 
the difference between these types of rules depends sim- 
ply upon whether the classical relations that one is tak- 
ing over into the quantum framework are relations be- 
tween measurements performed at the same time (lead- 
ing to the operator rules), at different times (leading to 
commutation relations for measurement operators), or in 
different frames of reference (leading to measurement- 
transformation relations and to the explicit forms of 
transformation operators). In short, from the perspec- 
tive provided by the derivation, the commutation rela- 
tion [x, p x ] = ih is no more elusive in its origin than the 
operator relation H = p 2 ,/2m. 

Sixth, the derivation provides a clearer physical foun- 
dation to many particular formal rules that are com- 
monly used in quantum theory. For example, the commu- 
tation relation [L^, L y ] — ihL z is ordinarily derived in the 
infinite-dimensional quantum formalism for a particle (by 
transposing the classical relation L 2 = xp y — yp x , and 
cyclic permutations thereof, into the quantum framework 
using the operator rules), and is then assumed, without 
further justification, to also hold in the finite-dimensional 
case. Here, we have obtained this commutation relation 
directly for finite- and infinite-dimensional quantum sys- 
tems, and have done so in a manner that makes clear its 
connection with the properties of rotations. Similarly, a 
restricted form of Dirac's Poisson bracket rule has been 
derived in a systematic manner using the AVCP without 
making use of abstract analogies. 

Finally, we remark that the general notion of average- 
value correspondence is already familiar in elemen tary 
quantum mechanics through Ehrenfest's theorem [22j], 
which shows that the motion of a particle modeled in the 
quantum framework is, on average, approximated by the 
behavior of the particle when described classically. How- 
ever, the possibility that such a correspondence might 
serve as the basis for a constructive principle that allows 
the formal rules of quantum theory to be determined by 
appropriately-chosen classical relations does not appear 
to have been widely explored [36| . It has been shown here 
that it is possible to formulate the notion of average- value 
correspondence in the form of an exact physical principle 
which, roughly speaking, allows the logic of Ehrenfest's 
argument to be reversed, enabling the often physically 
obscure formal rules of quantum theory to be derived 
in a systematic manner from familiar relations known to 
hold in classical physics. 
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The basic operator rules of quantum theory 
are (i) Rule 1 (Function rule): If a measurement of A is 
represented by operator A, then a measurement of f(A) 
is represented by /(A); (ii) Rule 2 (Sum rule): If mea- 
surements of A and B are represented by operators A, B, 
respectively, then a measurement of fi(A) + /2(B) 
is represented by ,fi(A) + /2(B) 
more than two observables) , and 
uct rule): If measurements of A 
sented by commuting operators 
then a measurement of fi(A)f2(B) is represented 
by /1 (A) /2(B) (and similarly for more than two ob- 
servables). A more general rule that is often employed 
is: (iv) Rule 3' (Hermitization rule): If measurements 
of A and B are represented by operators A, B, respec- 
tively, then a measurement of fi(A)f2(B) is represented 
by (/i(A)/ 2 (B) + / 2 (B)/i(A))/2. The above rules are 
generally assumed to apply to arbitrary Hermitian 
operators. 

See HI (Sec. 11) and [r| (Sec. IVT), for example. 
See, for example, [l7[ (Sec. IV.l), H|] (Sec. 9.12—9.15), 
and [25|] (Sec. 5.2.1). We shall discuss one such example 
in Sec. MIA II 

A measurement that is a degenerate form of a measure- 
ment A is defined operationally in Paper I. Such a mea- 
surement has N' < N possible outcomes, and can be 
represented as an iV-dimensional degenerate Hermitian 
operator (with N' distinct eigenvalues). 
A sub-system measurement is defined in Paper I as a 
measurement performed on a single sub-system of a com- 
posite system. 

If the measurements of the observables of which the clas- 
sical Hamiltonian is a function are not represented by 
quantum measurements, then it is not possible to write 
down the operator corresponding to the classical Hamil- 
tonian. Similarly, if the classical Hamiltonian is not a 
simple function of the observables of the system, then the 
form of the operator that represents a measurement of en- 
ergy cannot be determined by the AVCP and, therefore, 
the explicit form of Ut(At) cannot be obtained from the 
argument given in the text. However, these assumptions 
do not appear to represent a significant restriction in any 
fundamental cases of interest (in both non-relativistic 
and relativistic cases). 
See UK], Appendix 2. 
See, for example, Ref. [2J], Appendix 1. 
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[35] See [I|(Ch. 10, Sec. 3), for example. 

[36] Refs. 1 171. Il8l . [2J| mention the general idea of average- 
value correspondence in their discussion of the op- 
erator rules of quantum theory. For example, Groe- 
newold [3 (Eqs. (1.32)-(1.34)) remarks that the sum 
rule is equivalent to a condition on the expectations of 
the respective operators, but the idea is not formulated 
in a manner that is sufficiently systematic to derive the 
operator rules, and no attempt is made to derive the 
any of the other types of formal rule (such as the com- 



mutation relations) using average-value correspondence. 
Bohm [24| clearly articulates the idea that average- value 
correspondence can be used as a constraint on quantum 
theory, and uses it to determine particular instances of 
the function rule (Sees. 9.5-9.21) and to determine the 
Hamiltonian operator that represents a non-relativistic 
particle (Sees. 9.24-9.26). However, the idea is not sys- 
tematically formulated and applied beyond these special 
cases. 



